Goto

Collaborating Authors

 layer leader


3e36cbffea708197676fa794ad57dc0a-Paper-Conference.pdf

Neural Information Processing Systems

Inthis paper,weconsider thematching ofmulti-agent multi-armed bandit problem, i.e., while agents prefer arms with higher expected reward, arms also have preferences onagents.